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We establish an information theoretic inequality concerning the binary skew-symmetric 
broadcast channel that was conjectured by one of the authors. This inequality helps to quantify 
the gap between the sum rate obtained by the inner bound and outer bound for the binary 
skew-symmetric broadcast channel. 

1 Introduction 

The broadcast channel is a fundamental network information theory setting modeling the commu- 
nication of messages (private and common) from a single sender to multiple receivers. For formal 
definitions and early prior work the reader is referred to [TJ [2] • There has been some recent progress 
for the discrete memoryless setting, and this work establishes a conjecture proposed in one of the 
£T) '. recent papers [7]. 

We consider the broadcast channel where sender X wishes to communicate independent mes- 
sages M\,M 2 to two receivers Yx,Y 2 . The capacity region for the broadcast channel is an open 
■^■j- , problem and the best known achievable region is due to MartonjS] and is presented below. 

Bound 1. J2|/ The following region is achievable 

ON! r i <*(U, W',Yi) 

Ri < I(V,W;Y 2 ) 
Ri + R2< I(U, W; Yi) + I(V; Y 2 \W) - I(U; V\W) 
Ri + R2< I(V, W; Y 2 ) + I(U; Y X \W) - I{U; V\W) 

for any triple of random variables p(u,v,w) such that (U,V,W) — > X — > (Yi, Y2) form a Markov 
chain. 

Capacity regions have been established for a number of special cases and in every case where 
capacity is known, the following outer bound and Marton's inner bound yields the same region. 

Bound 2. The union of rate pairs 

Ri<I(U;Yi) 
R2<I(V;Y 2 ) 
Ri + R 2 <I(U;Y l ) + I(V;Y 2 \U) 
Ri + R 2 <I{V-Y 2 )+I{U;Y 1 \V) 

over pairs of random variables p(u,v) such that (U, V) — > X — > (Y\, Y 2 ) form a Markov chain 
constitutes an outer bound to the capacity region. 
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In [6] the authors studied Bound 2 for the binary skew symmetric channel and showed that the 
line segment^ of R\ + Ri = 0.3725.. lies on the boundary of the outer bound. In [TJ the authors 
studied Marton's inner bound for the binary skew-symmetric broadcast channel and showed that 
provided an information theoretic inequality (Conjecture [1]) holds, a line segment of R\ + R 2 = 
0.3616... lies on the boundary of the Marton's inner bound. 

Conjecture 1. |?]/ For the binary skew- symmetric channel shown in Figured 

I(U; Fi) + I(V; Y 2 ) - I(U; V) < max(I(X, Y{),I{X, Y 2 )) 
for all (U, V, X) such that (U, V) — X — (Y±, Y 2 ) forms a Markov chain. 

It should be noted that this inequality was established in [3] when U, V were independent; and 
in [7] for dependent U, V and P(X = 0) G [0, ±] U [f , 1]. 

The outline of the proof is as follows: (Parts 1 and 2 were established in [3] and is presented 
for completeness) 

1. We show that to obtain the maximum sum-rate in Marton's region it is sufficient to consider 
X as a function of the auxiliary random variables U and V. 

2. We further show that the cardinality of U and V can be restricted to \X\, which in this case 
is 2. 

3. As U and V are binary, we conclude that the conjecture is true iff it holds true for each of 
the 16 cases where X = f(U,V). We further prove that it is sufficient to consider only the 
two cases X = U A V and X = U © V by showing that either the other cases are immediate 
or it reduces to the above two cases. 

4. We prove the conjecture in X = U A V case using a multiplicative perturbation to derive 
properties about the distribution on U, V which achieves the maxima in LHS of [TJ 

5. Similarly, we prove the conjecture in X = U(BV case using an additive perturbation to derive 
properties about the distribution on U, V which achieves the maxima in LHS of [TJ 

1 There is a typo in the evaluation of this bound in the original paper though the main results are right. In the 
Appendix we will show the corrected derivation of this bound. 
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Gohari and Anantharam [3] established bounds on the cardinalities of the auxiliary random vari- 
ables needed to evaluate the Marton's achievable region. In this section, we present a modified 
version of their arguments for completeness. 
Define the following three quantities: 

• M = max(/([/; Y x ) + I(V; Y 2 ) - I(U; V)) over all (U, V, X) such that (U, V) - X - (Y 1 ,Y 2 ) 
forms a Markov chain. 

• M d = max(/(C7; Y x ) + I(V; Y 2 ) - I(U; V)) over all (U, V, X) such that (U, V) - X - (Yx,Y 2 ) 
forms a Markov chain and X = f(U, V). 

• M$ xn = max(I(U;Yi) + I(V;Y 2 )-I(U;V)) over all (U,V,X) such that {U,V)-X-(Y 1 ,Y 2 ) 
forms a Markov chain; \U\ < \X\, \V\ < \X\, and X = f(U,V). 

We will show that M = Md = M d for any discrete-memoryless broadcast channel. 
Fact 1. M = M d 

Proof. Using standard arguments, for e.g. [1], there exists a random variable W independent of U, V 
such that X = f{U, V, W). Now set V = (V, W) and observe that I(U; Yi) + I(V; Y 2 ) - I{U; V) < 



Remark 1. One way to construct such a W is the following^: For every u,v consider the sequence 
ti(u,v) = P(X < i\U = u,V = v),l < i < \X\. Mark the points ti(u, v), . . . , t|#i(u, v)(= 1) for 
all choices of (u, v) along the unit interval [0,1]. The points define intervals (at most |£/||V||A?|) 
and generate W as an independent random variable with probabilities defined by the length of the 
intervals. As the P(X = i\U = u, V = v) can be thought of as W falling in a certain consecutive 
set of appropriately chosen intervals, there is a natural mapping (U, V, W) i— > X. 

Claim 1. M d = M d m 

Proof. This is a simplified version of the arguments of Gohari and Anantharam [3], adapted to 
this setting. For a given p(u, v, x) consider the multiplicative Lyapunov perturbation defined by 
q(u, v, x) = p(u, v, x)(l + eL(n)). For q(u, v, x) to be a valid probability distribution we require the 
following two conditions: 1 + eL(u) > 0,Vu and ^2 u p(u)L(u) = 0. Note: If p(u,v,x) = then 
q(u, v,x) =0 and hence X continues to be a function of (U, V) under any such perturbation. 

If distribution p(u,v,x) maximizes I(U;Yi) + I(V;Y 2 ) — I(U;V) then we must have that for 
any valid perturbation 



I(C/;y 1 )+I(y / ;F 2 )-J(C/;F'). 



□ 



1. f e I(U; Y x ) + I(V; Y 2 ) - I(U; V) = 0, 



2. 



9 2 
ik- 



I{U;Yx) + I(V;Y 2 )-I(U;V)<0. 



Consider a class of perturbations L(u) such that 




(1) 



Observe that these perturbations keep the distributions of X (hence Y X ,Y 2 ) unchanged. 



2 This construction was mentioned to one of the authors by Bruce Hajek. 
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Observation 1. There exists such a non-zero perturbation if \U\ > \X\ since the null-space of the 
constraints have rank at most \X\. 

Observe that 

I q (U;Y x ) + I q (V;Y 2 )-I q (U;V) 

= H q (Y x ) + H g (Y 2 ) + H q (U, V) - H q {U, Y x ) - H q (V, Y 2 ) 
= H p (Yi) + H p (Y 2 ) + H q (U, V) - H q (U, Y x ) - H q (V, Y 2 ) 

= H p {Y x ) + H P (Y 2 ) + H p (U, V) + eH^U, V) - H P (U, Y x ) - eH^(U, Y x ) - H q (V, Y 2 ). 

Here H£(U,V) = - £^(11, log p(u,v), H^U^) = - ^ u , yi yx)L(u) logp(«,yi). 

Therefore, ^/(C/"; Fi) + J(V; ^ 2 ) - /(^; V) < implies ^H q {V, Y 2 ) > and this implies 

£(£(L|V,Y 2 ) 2 ) <0 

or in particular E(L\V,Y 2 ) = whenever p(v,y 2 ) ^ 0. This, in turn, implies 

H q (V,Y 2 ) = H P (V,Y 2 ). 

Using this we obtain 

I q {U-Y x ) + I q {V;Y 2 )-I q (U-V) 

= H P {Y{) + H P (Y 2 ) + H p (U, V) + eH^U, V) - H p (U, Y x ) - eH^U, Y x ) - H q (V, Y 2 ) 
= H P {Y X ) + H P (Y 2 ) + H P (U, V) + eH^(U, V) - H p (U, Y x ) - eH^U, Y x ) - H P (V, Y 2 ). 

The first derivative being zero implies H p (U, V) — H p (U, Y x ) = and this finally implies that 
if p(u,v,x) attains the maximum of I(U;Y X ) + I(V;Y 2 ) - I(U;V) then I q (U;Y x ) + I q (V;Y 2 ) - 
I q (U; V) = I P (U; Y x ) + I p (V; Y 2 ) - I p (U; V) for any valid perturbation that satisfies ©. 

Now we choose e such that min u 1 + eL{u) = 0, and let u = u* achieve this minimum. Observe 
that q(u*) = and hence there exists an U with cardinality equal to \U\ — 1 (at most) such that 
I(U;Y X ) +I(V;Y 2 ) — I(U;V) is constant. We can proceed by induction until \U\ = \X\. Observe 
that when \U\ = \X\, we are no longer guaranteed the existence of a non-trivial L{u) satisfying ([1]). 

The argument can then be repeated for V to make |V| < | | as well. 

This completes the proof that = MV . □ 

Remark 2. Use Fact Q] and Claim [H to prove the conjecture [U it suffices to consider binary U, V 
and X = f(JJ,V). There are 16 possible boolean functions on binary (U, V) and we establish the 
conjecture for each such function. 

We use the following notation: U A V (and), U V V (or), U © V (xor), U (not). 

Observation 2. Each of the following groups of functions are equivalent upto re-labeling (of either 
U or V or both) 

• X = U, X = U, 

• X = V,X = V, 

• X = U AV, X = U AV, X = U AV, X = U AV, 

• X = UW,X = UWV,X = UVV,X = UVV, 
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• x = u®v,x = u®v 

Claim 2. The conjecture is valid when X = 0, X = 1, X = U, X = V . 

Proof. In the first two cases, the conjecture reduces to —I(U; V) < (true by non-negativity 
of mutual information). In the third case conjecture follows from data processing inequality as 
I(V;Y 2 ) < I(V;X) and hence I(X;Yi) + I{V;Y 2 ) - I(V;X) < The fourth case follows 

in a similar manner as the third. □ 

Claim 3. The conjecture is valid for all distributions p(u, v) when X = U A V , if and only if the 
conjecture is valid for all distributions q(u, v) when X = U V V . 

Proof. This follows from the skew-symmetry of the channel and that X = U V V is equivalent to 
X = U A V. Let P(U = i,V = j) = pij for every i,j € {0, 1}. For concreteness, when X = U A V 
the conjecture is equivalent to 

, ,P00 + P01 + PKK / . w4x , . w PlO s . wPlU 

h{ t. ) - Poo + PoiJMo) - (Pio + Pii)"(o7 T \ ) + Kir) 

2 2 2(pio+pu) 2 

- (poi + Pii)Mtt7 — r) - ^(Poo + Poi) + (Poo +Pio)M — ^ — ) + (Poi + Pll)M — ) 

2(poi +Pn) Poo +Pio Poi +Pn 

, /, ,P00+P01+Pl0 A , . . ,/PHn i^M 

< max h( ) - (poo + Poi + Pio)h{-),h(—) - Puh(-) V , (2) 

where h(x) = — xlog 2 (:r) — (1 — x) log 2 (l — x) represents the binary entropy function. 

For the X = U V V case, let P(U = i,V = j) = qij for every i,j 6 0, 1. 
The conjecture is now equivalent to 

- (9oi + QooM—, \ J + K 7, ) - (911 + Qoi)h{-) 

2 2((/oi + 900j 2 2 

- (9io + Qoo)h{— ( — r) - %n + g i) + (911 + 9io)M — "v — ) + (9oi + 9oo)M — ~r — ) 

2(gio + 9ooJ 9n + 9io 9oi + 9oo 

< max + g ° 1 + gl ° ) - (q xl + g i + 9lo)M^): H^f-) ~ Qooh(\)\ , (3) 

The bijection poo *-> 9iijPoi ^ 9oi>Pio *-> 9io>Pn *-* Qoo completes the proof of the equivalence 
of the conjectures under the constraints X = U A V and X = U V V. □ 

Corollary 1. From Remark^ Observation^ Claims^ and\^ it follows that the conjecture is true 
provided it holds when X = U A V and X = U © V. 



2.1 Case 1: X= U A V 

We prove the conjecture in this case by studying the local maxima. Clearly the conjecture is true 
when two of the three terms PoO)Poi)Pio are identically 0. When this happens, then the condition 
reduces to X = U, V = 1, X = V, U = 1, or U = V = X , each of which is solved by Claim [2j 
Clearly if pn = then X = 0; in which case the conjecture is valid. So we assume that pn > 0. 
Therefore, we only establish the validity of the conjecture for the remaining cases. 

Consider a perturbation q(u,v,x) = p(u,v, x)(l + eL(u,v)) that maintains P(X = 0). This 
implies that the perturbation satisfies 

£11 = 0, poo^oo + Poi-^01 + Pio^io = 0. (4) 
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For any local maxima of I(U; Yi) + I(V; Y 2 ) — I(U; V), the derivative with respect to e must be 
zero for all perturbations satisfying (j4j), i.e. 

H L (U,V) = He^y^U^) + H E{mY2) {V,Y 2 ). (5) 

The terms H L (U,V), H E ( L \ UXl )(U,Yi), H E ( L \ VX2 )(V,Y 2 ) correspond to 



H L (U, V) = -pqqLqq log poo - Pio-kio logpio - Poi^oi logpoi, 

0-^10 , 

log 

2 s 2 



ZJ tTT\S\ IT* T \ 1 /POO+POls PlO-klO , PlO 

H E(L\u,Yi)y u > Y V = -{P00L00+P01L01) log( ) — log ■ 



2 — log(— 

H E(L\V,Y2)( V i Y 2) = -(^00^00 +Pl0^10)log(p00 + Pio) ~ POlAn log(poi + ^) 



Pll 



2 

2.1.1 Case 1.1 poo, p i,Pio, Pii > 

In this case the conditions (|4|) and ([5]) imply that the following equalities hold: 

Poo = Poo + Pro 
P01 Pol + E 2 J - , 
Poo Poo + P01 



Pio VWIpio + 2pn) 

These conditions are obtained by setting L\q = and 7oi = respectively. 
The above two conditions imply that 

P01 _ 2 P11 =4 
Poo ' Pio 

These two equalities along with poo +P01 +P10 +P11 = 1 implies that any non-trivial local maxima 
is of the fornjfl 

1 - 1 2(1 - t) t At 

Poo = — ^— ,Poi = 5 ,Pio = 7, P11 = 

6 3 5 5 

We need to verify the conjecture at this point. It suffices to show that 

I{U- Y x ) + 7(V; Y 2 ) - 7(17; V) < I(X; Y{) (< max{7(X; Y 1 ), I(X; Y 2 )}) . 

This is equivalent to showing (for < t < 1) 

< 77(Yi|C/) - # (Yi|X) - 77(1/|Z7) + H(V\Y 2 ) 

1 4/ 11 2/ 1 

= (!"*)+ "'(jq) - (1 - -5) - (1 - ')A(j) - tfc(j) + (1 - jlMj) 

=t (l_ 1+ ^ )+fc (l ) _ fc( l)_l fc( l ) 

5 V 3 2 V 

and this is clearly true as|/i(§)<fr(g). This proves the validity of the conjecture when poo,Poi, Pio > 
0. 



This local maxima exists only when P(X = 1) = pu < |, and hence there is no local maxima when P(X = 1) > -. 
When P(X = 1) > |, there was a simple argument in [7] that established the conjecture. It is curious that both the 
approaches lead to a simple proof in this regime. 
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2.1.2 Case 1.2 p 01 = 0; p 0Q , p w ,Pn > 

In this case the conditions (HD and (0) imply that the following equality holds: 

Poo Poo 



Pw ^10(^10 + 2^11) 
However this cannot hold if Poo,Pio,Pn > 0. 

2.1.3 Case 1.3 pio = 0; poo, Poi,Pii > 

In this case the conditions (JU) and ([5]) implies that 

Poo Poo 



P01 Poi + E j-' 



Again this cannot hold if P001P1O1P11 > 0- 

2.1.4 Case 1.4 poo = 0;pi ,Poi,Pii > 

In this case the conditions @ and (0) implies 



Pi. 1 

Vpio(pio + 2pn) =Poi + —- (6) 

To eliminate this possibility, we show that any point that satisfies ([6]) cannot be a local maxima. 
Observe that for a local maxima one also requires j^I(U ; Y\) + I(V; Y 2 ) — I{U ; V) < 0, i.e. 

E[E[L UV pY^ 2 } + E[E[L UV \VY 2 ] 2 } - E[E[L UV \UV] 2 ] < 0. 

Equivalently for all perturbations satisfying L\\ = and poi-^01 + Pio-^10 = 0, any local maxima 
must satisfy 



o 9 9 1 9 1 ?-^in 

PoxL 01 +pi L 10 > p iL 01 + -pio-^10 + o — L io 



-2 



+ ^10^10 + - ^W-^oi. 
which is clearly not possible when pio,Poi,Pn > 0. This completes the proof of Case 1. 

2.2 Case 2: X = U ® V 

We again prove the conjecture in this case by studying the local maxima. As before, the conjecture 
is true when two of the four terms Poo,Poi,Pio,Pii are identically 0. When this happens, then the 
condition reduces to X = U, V = 1, X = V, U = 1, X = 0, X = 1, X = U, V = or X = V, U = 0, 
each of which is solved by Claim [2j Therefore, we only establish the validity of the conjecture for 
the remaining cases. 

Consider a perturbation^ q(u, v, x) = p(u, v, x) + e\(u, v, x) for some e > 0. For this to be a 
valid perturbation we require 

Aooi, Aqio, A100, Am > (7) 



4 Note that this perturbation is a more general perturbation that the one we have used so far, the multiplicative per- 
turbation of the form q(u, v, x) = p(u,v,x)(l + eL(u,v,x)). The multiplication perturbation ensures that if p(u, v, x) = 
then q(u,v,x) = 0; however an additive one need not preserve this. Setting X(u,v,x) — p(u,v,x)L(u,v,x) shows 
that the multiplicative perturbation is a special case of the additive perturbation. It turns out that in the case 
X — U © V, the analysis of the local maxima is greatly simplified if we consider an additive perturbation; as we are 
finding the local maxima over a possibly larger space. 
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as the corresponding p(u,v,x) are zero. Further let us require that the perturbation maintains 
P(X = 0). This implies that the perturbation satisfies 

AoOO + Aoio + Aioo + ^110 = 

Aooi + Aon + Aioi + Am = (8) 



For any perturbation that satisfies (I15h and ([8]) at any local maximum it must be true that the 
first derivative cannot be positive. This implies 



where 



H X (U, V) - H muXl) (U, Y 1 ) - H EWVtYa) (V, Y 2 ) < 0, (9) 

H X (U, V) = -(Aooi + A oo) logpoo - (Aoio + A n) logp i - (Aioo + Aioi) logpio 
- (Ano + Am)logpii 

Tj (ttxt\ (AoOO + Aoio) , /POOn / (Aooo + Aoio) . . v \ , /POO . \ 

H E (x\u,Y 1 ){U,Y 1 ) = - log(— ) - I + Aooi + Aon J log(— + Poi) 

(Ano + Aioo) , f Pii\ /(Ano + Aioo) . , . , V ,pu . , 
log(— ) - h Aioi + Am log(— + pro) 



2 ° v 2 ' V 2 / ° v 2 



tj / t t- \ /(Aooi + Aioi) . , A, ,p\q , (Aooi + Aioi), ,PWs 
H E (x\v,y 2 ){V, Y 2 ) = -l + Aioo + Aooo J lo s(^- + Poo) ^ log(— ) 

(Aon + Am) \ .poi , (Aon + Am) ,Poi\ 
+ Aqio + Ano log(— + Pn) log(— ). 



2 J ov 2 2 ov 2 

2.2.1 Case 2.1 Poo,Poi,Pio>Pn > 

Let a,b,c,d > and let us choose Aooi = a = — Aooo, Aioo = b = —Aioi, Aoio = c = —Aon, and 
Am = d = —Ano- Observe that this choice satisfies (|15p and (|8|). Therefore from the constraint 
([9]), we must have for all choices of a, b, c, d > 

(a-c) pop (d-b) p u 
< — - — log 1 — log ■ 



2 poo + 2po\ 2 pn + 2p w 

, ( b ~ a ) i Pio . (c-d) poi , , 

+ — o — log —7> + — o — log — (10) 

2 pio + 2poo 2 poi + 2pn 

• Setting a = c = k, b = d = I we require 

(l-k) Pio(poi + 2pn) 

« l°g 7 7 — 

2 PoilPio + 2poo) 

for all Z, A; > which is true if and only if 

PioPn = PoiPoo- (11) 

• Setting a = b = I, c = d = k we require 

(l-k) Poo(Pn + 2pi ) . „ 

log ; r- > U 

2 Pn(Poo + 2p i) 

for alH, k > which is true if and only if 

PoiPn = PioPoo- (12) 
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Setting a = d = I, b = c = k we require 



(I ~ k) 



log 



Poo 



pn pio + 2p 00 Poi + 2pn 



Poo + 2poi Pn + 2pio Pio 



> 



Poi 



(13) 



for all I, k > 0. Observe that equations (fTTj) and (fT2j) imply that poo = Pn and poi = Pio- Let 
P = Poo = Piii 9 = Poi = Pio- Substituting this choice into (JT2J) implies that 



(l-k) 



log 



P 



/ g + 2p x 



> 



2 °VP + 2g 
for all/, k > which is true if and only if p = q. 

Therefore the only choice of Poo>Poi)PiO)Pio > that satisfies the constraint (fTUj) for all choices 
of a,b,c,d > is when poo = Poi = Pio = Pio = \- I n this case observe that U, V and X are 
mutually independent, and the conjecture is trivially true as I(U; Y\) + I(V; I2) — ^(C^; V") = 0. 

2.2.2 Case 2.2 One among PoOjPoijPiOjPn is zero 

All these cases are similar to each other and reduces to a particular X = U A V case, and hence 
the validity of the conjecture follows.. For example, when pqq = 0, observe that X = if and only 
if U = V = 1. Therefore this can also be viewed as a special case of X = U A V. (Note that we 
have already shown the equivalence between the X = U AV and X = U V V cases.) 



Condition 


Equivalent X = U AV case 


Poo = 


X = U AV 


Poi = 


X = U AV 


Pio = 


X = U AV 


pn = 


X = U AV 



Since the conjecture was established when X = U A V , this equivalence completes the proof 
when X = U © V. Thus Conjecture [T] is established. 



3 Sum-rate evaluations of inner and outer bounds for BSSC 

We shall evaluate the inner and outer bounds for the BSSC from [B] and [7]. Apart from complete- 
ness, this section serves three purposes: 

• We present a proof that to compute the maximum sum-rate of the Marton's achievable region 
it suffices to restrict ourselves to \W\ < \X\ 

• We correct a minor typo in the evaluation of the maximum sum-rate of the outer bound 
presented in [6]. 

• We also compute the maximum sum-rate obtained via the Korner-Marton outer bound for 
the BSSC. 
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3.1 On sum-rate evaluation of Marton's inner bound 

Though this evaluation was done in [TJ, assuming the conjectured inequality; we present a slightly 
different, albeit more general, argument that produces the same result. We first prove that for any 
broadcast channel it suffices to restrict ourselves to \W\ < \X\ to compute the maximum sum-rate 
of the Marton's achievable region. In [7J we proved this fact using some properties of the BSSC 
channel and here we present a general argument. 

Claim 4. For a discrete memoryless broadcast channel, to compute the maximum of 

XI(W; Fi) + (1 - X)I(W; Y 2 ) + I(U; Y 1 \W) + I(V; Y 2 \W) - I(U; V\W), < A < 1 
over all choices of (U, V, W) — > X — > (Y\, Y 2 ) it suffices to restrict to \W\ = X\. 

Proof. Let p(u, v, w, x) achieve a maximum of the above expression. As before, we consider 
multiplicative Lyapunov perturbation defined by q(u,v,w,x) = p(u,v,w,x)(l + eL{w)). For 
q(u, v, w, x) to be a valid probability distribution we require the conditions 1 + eL{w) > 0, Vw 
and s ^2 w p{w)L{w) = 0. Further let us require that the perturbation maintains P(X = x), that is 

E(L\X = x) = J2p(w\x)L(w) = 0. (14) 

w 

Remark: There exists nontrivial L(w) if |W| > \X\. 
Observe that 

XI q (W; Y) + (1 - X)I q (W; Z) + I q (U; Y\W) + I q (V; Z\W) - I q (U; V\W) (15) 
= +\H P (Y) + (1 - X)Hp(Z) + X(H P (W,Z) + eH^W, Z)) + (1 - X)(H p (W,Y) + eH}{W,Y)) 
- H P (U, W, Y) - eH^U, W, Y) - H P (V, W, Z) - eH^V, W, Z) + H p (U, V, W) + eH L p (U, V, W) 

where 

flJ(W.y) = - ^2 p{w,y)L(w) log p(w,y), 

w,y 

H^W, Z) = - ^2p(w, z)L{w) \ogp{w, z), 

w,z 

H p (U,V,W) = p(u,v,w)L(w) log p(u,v,w), 

u,v,w 

H p (U,W,Y) = p(u,w,y)L(w)logp(u,w,y), 

u,w,y 

H£(V,W,Z) = -J2 p(v,w,z)L(w)logp(v,w,z). 

v,w,z 

The first derivative with respect to e being zero implies 

XH^(W, Z) + (1 - X)H^(W, Y) - H£(U, W, Y) - H p (V, W, Z) + h£(U, V, W) = 0. 

Substituting this into (fT5|) we obtain 

XI q (W; Y) + (1 - X)I q (W; Z) + I q (U; Y\W) + I q (V; Z\W) - I q {U; V\W) 
= XI P (W; Y) + (l- X)I P (W; Z) + I P (U ;Y\W) + I P (V; Z\W) - I P {U; V\W) 
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for any valid perturbation that satisfies (|14p . 

Now we choose e such that min^ 1 + sL(w) = 0, and let w = w* achieve this minimum. 
Observe that q(w*) = and hence there exists an W with cardinality equal to |W| — 1 such that 
\I(W; Y) + (l- \)I(W; Z) + I(U;Y\W) + I(V; Z\W) - I(U; V\W) is preserved. We can proceed 
by induction until |W| = \X\. This completes the proof of this claim. □ 



3.1.1 Evaluation of the maximum sum-rate of Marton's region for BSSC 

Clearly from the above claim we can assume that |W| = 2. Observe that for the BSSC, I(X; Y±) > 
I(X; Y 2 ) if and only if P(X = 0) < ±. If < P(X = 0\W = 0), P(X = 0\W = 1) < § it is easy to 
see, using the established inequality that 

SR = mm{I(W;Y 1 ),I(W;Y 2 )) + I(X;Y 1 \W) </(X;Yi) <C, 

where C is the single channel capacity given by /i(0.2) - 0.4 w 0.321928.. Similarly if \ < P(X = 
0\W = 0),P(X = 0\W = 1) < 1 again the sum-rate will be bounded by C. Hence we can assume 
that < P(X = 0\W = 0) < \ < P(X = 0\W = 1) < 1. 

Let d = maXp^) I(X; Y\) — I(X; Y 2 ). Then we can solve for d = 0.10072952.. and the optimizing 
choice for P(X = 0) = 0.15843497... Now observe that 

SR < I(W;Yi)+P(W = 0)I(X;Yi\W = 0) + P{W = 1)I(X;Y 2 \W = 1) 
= I(X;Y 1 )+P(W = 1)(I(X;Y 2 \W = 1) - I^X-^W = 1)) 
< IiX-Y^ + PiW = l)d. 

Similarly 

SR < I(W; Y 2 ) + P{W = 0)/(X; Y x \W = 0) + P{W = 1)/(X; Y 2 \W = 1) 
= I(X; Y 2 ) + P(W = 0)(I(X; Y X \W = 0) - I(X; Y 2 \W = 0)) 
<I(X;Y 2 ) + P(W = 0)d. 

From these two (by adding them) we can deduce that 

2SR < /(XjYi) + I(X;Y 2 ) + d. 

The maximum of I{X; Yi) + I(X; Y 2 ) = 0.6225562.. occurs when P(X = 0) = \ and hence 
substituting we obtain that SR < 0.36164288... 

To show that it is indeed on the boundary of the achievable region consider the joint distribution 
on X and W as follows: 
p{W = 0) =p(W = 1) = \ 

p(X = 0\W = 0) = 0.15843497.. and p(X = 0\W = 1) = 0.84156502.. 

For this distribution, all inequalities reduce to equalities and SR of 0.3616.. is achieved. 



3.2 Sum-rate evaluations of the outer bounds for BSSC 
3.2.1 Case 1: Bounds 

To evaluate maximum of the sum-rate of the outer bound (Bound [2]) it was shown [6] that it 
suffices to consider P(X = 0) = i. (It is immediate using the skew-symmetry of the channel and 
the inherent symmetry of the outer bound expressions.) 
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The sum-rate maximum is hence given by 

max I(U;Yi) + I(X;Y 2 \U) 

p(tt^e),P(x=0)=5 

or in other words maximize 

max I{X;Y 1 )+I{X-Y 2 \U)-I{X-Y 1 \U) 

p(u,x),P(a;=0)=~ 

Let P(x = 0) = x. In [7j it was shown that the curve /(x) = I(X;Yi) - J(X;y 2 ) = /i(§) - 
— ^ + 1 — 2x is concave when x £ [0, 5] and convex when x € [0, |]. Further it was also shown that 
the lower convex envelop^! was given by 

\ f(x) l<x<l 
From the definition of the lower convex envelope, we know that 

IiX-Y^U) - I{X-Y 2 \U) > ff (i) 

and it easy to see that the equality is indeed achieved for a binary U. 
Therefore 

max I(X; Yy) + I(X; Y 2 \U) - I(X; Y^U) = h(-) - 0.5 + 0(0.5) ~ 0.3725562... 

P (U,X),P(X=0)=1 4 

This is a correction to the implicit error I made in [6] while calculating the lower convex envelope 
and obtained a bound of 0.37111.... 

3.2.2 Case 2: Korner-Marton Bound 

To show that this sum-rate is still strictly inside the Korner-Marton [5] outer bound observe that 
we need to evaluate the union over p(u, x) 

Ri <I(U; Yi) 
R 2 <I(X;Y 2 ) 
Ri+R 2 <I(U;Y 1 ) + I(X;Y 2 \U) 

Further if a point (R\,R 2 ) = (a, a) belongs to this region, by the skew-symmetry of BSSC, it 
will also belong to the union over p(y, x) 

R! <I{X;Y X ) 
R2<I(V;Y 2 ) 
Ri+R 2 <I(V;Y 1 ) + I(X;Y 2 \V) 

and hence to the intersection of the two regions. The key difference between the bounds is that 
while the former takes the intersection before the union, the latter takes the union prior to the 
intersection. 

5 more precisely, in [7] the upper concave envelope was characterized, and the characterization of the lower convex 
envelope follows by symmetry. 
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Suppose we wish to compute 

max I(X; Y x ) + I(X; Y 2 \U) - I(X; Y x \U) 

then from the earlier discussion, this will be the maximum over x 6 [0, 1] of 

M|) - x-g(x) 

It is easy to see that the global maximum will lie when x £ [0, |] (otherwise maximum occurs 
when U is trivial and equals I(X;Y 2 )). Taking derivatives we obtain that maximum occurs when 

1 ^ 2 ~ x 1 5 ^ 4 ^ n 
-log 2 1--/ - =0 

2 x 4 5 

or 

x* = « 0.4571429... 

1 + 2 C 

where c = 2(1 + |/(|)) ~ 1.7548875... 
Thus the maximum sum rate given by 

max J(X;Fi) + I(X;Y 2 \U) - I(X;Yl\U) w 0.3743955... 

piu,x) 

The pair (U, X) that achieves the maximum can be characterized by 

P(U = 0) = 1 - a, P(X = 0\U = 0) = 0, P(U = 1) = a, P(X = 0\U = 0) = - 

5 

where 0.8 * a = x* or a w 0.5714286.. 
Observe that for this choice 

I(U; Yx) = h{—) - ah(0A) rs 0.2206837... 

/(X;F 2 |^) - 0.1537118.. 
I(X;Y 2 ) w 0.3006499 

Therefore the point (Ri,R 2 ) = (0.1871978.., 0.1871978..) lies on the boundary of the Korner- 
Marton outer bound. In summary, the maximum sum rate given by Korner-Marton outer bound 
for the BSSC is 0.3743955.... 

Conclusion 

We establish an inequality for the binary skew-symmeteric broadcast channel that was conjectured 
in [7]. Thus we have quantified the gap between the outer bounds and the inner bounds for this 
channel. It would be great to determine which of the bounds are weak, and if possible improve 
them at least for this interesting channel. 
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